About

Overview

The cran-search project aims to provide a database to perform a package search for the R programming language in the Comprehensive R Archive Network (CRAN) repository. The collected data are obtained by the tools::CRAN_package_db() function and selected only a few columns to perform the search for the topic of interest.

In the following table, it is possible to verify a brief structure of the data frame collected with packages available in CRAN. For example, the number of rows and columns, and the frequency of words longer than 3 or 4 characters for the column named title, description, and license. A depth investigation of the data is at the discretion of the reader.

update structure information
2025-04-12 column update, package, version, license, title, description, date, depends, import, url
2025-04-12 n_column 10
2025-04-12 n_row 22328
2025-04-12 NA TRUE
2025-04-12 title frequency: (1) data 3693 (48.09%), (2) analysis 2282 (29.71%), (3) with 1705 (22.20%)
2025-04-12 description frequency: (1) data 15041 (42.26%), (2) package 10281 (28.89%), (3) with 10267 (28.85%)
2025-04-12 license frequency: (1) license 6316 (50.14%), (2) file 5773 (45.83%), (3) apache 507 (4.03%)

Author

Author
name url
author Bruno Faria
website https://brunofariadf.github.io/
github https://github.com/brunofariadf/
Project
name url
main cran-search
review News
license MIT